An Attempt to Automatic Thesaurus Construction from an Ordinary Japanese Language Dictionary

نویسندگان

  • Hiroaki Tsurumaru
  • Toru Hitaka
  • Sho Yoshida
چکیده

How to obta in h i e r a r ch i ca l r e l a t i o n s ( e . g , superord ina te -hyponym relation, synonym r e l a t i o n ) is one of the most important problems for thesaurus cons t ruc t ion , A p i lo t system for ex t r ac t ing these r e l a t i o n s au tomat ica l ly from an ordinary Japanese language d i c t i ona ry (Shinmeikai Kokugojiten, published hy Sansei-do, in machine readable form) i s given. The f e a t u r e s of the definition sentences in the d i c t iona ry , the mechanical ex t r ac t i on of the h i e r a r ch i ca l r e l a t i o n s and the es t ima t ion of the r e s u l t s are d i scussed .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction Of A Transfer Dictionary Considering Directionality

In this paper, we show how to construct a transfer dictionary automatically. Dictionary construction, one of the most difficult tasks in developing a machine translation system, is expensive. To avoid this problem, we investigate how we build a dictionary using existing linguistic resources. Our algorithm can be applied to any language pairs, but for the present we focus on building a Korean-to...

متن کامل

5 th International Workshop on Linguistically Interpreted Corpora

In this paper we describe the construction of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definition sentences, and uses an HPSG based Japanese grammar to encode the syntactic and semantic information. We show how this treebank can be used to extract thesaurus information from definition sentences in a language-neutral way using minimal recursion ...

متن کامل

The Hinoki Treebank: Working Toward Text Understanding

In this paper we describe the construction of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definition sentences, and uses an HPSG based Japanese grammar to encode the syntactic and semantic information. We show how this treebank can be used to extract thesaurus information from definition sentences in a language-neutral way using minimal recursion ...

متن کامل

Relations extracted from a Portuguese dictionary: results and first evaluation

This paper presents PAPEL, a lexical resource for Portuguese, consisting of relations between terms, extracted by (semi) automatic means from a general language dictionary. An overview on the construction process is given, the included relations are presented and a quantitative vision is provided together with some examples. Synonymy relations were evaluated using a thesaurus as a golden standa...

متن کامل

Automatic construction of dictionnaries, application to product characteristics indexing

Summary of Product Characteristics (SPC) indexing enables to extract all the information needed to analyze a prescription and find some inappropriate medications. We evaluate a French Multi-Terminology Indexer tool (F-MTI) for SPC automatic indexing. This tool uses a dictionary containing the textual forms that are likely to appear in natural language text for the drug clinical particular terms...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1986